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* An interesting question about learning is how an embedded agent can improve performance while acting in 
complex dynamical environment. Supervised learning is not feasible because precise knowledge of dynamical 
environment and correct response of agent are not available a priori. As a case study of control problem, we 
choose two types of ball playing games; shooting in basketball and squash tennis. In both cases, robust and 
efficient algorithm is needed. We applied the stochastic reinforcement learning method with temporal difference 
(TD) algorithm for controlling and prediction. The result was successful. 
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